- Title
- Local search-based approach for cost-effective job assignment on large language models
- Creator
- Liu, Yueyue; Zhang, Hongyu; Le, Van-Hoang; Miao, Yuantian; Li, Zhiqiang
- Relation
- GECCO: Genetic and Evolutionary Computation Conference. GECCO '24 Companion: Genetic and Evolutionary Computation Conference Companion (Melbourne, Australia 14-18 July, 2024) p. 719-722
- Relation
- ARC|DP200102940|DP220103044 https://purl.org/au-research/grants/arc/DP200102940
- Publisher Link
- http://dx.doi.org/10.1145/3638530.3654104
- Publisher
- ACM
- Resource Type
- conference paper
- Date
- 2024
- Description
- Large Language Models (LLMs) have garnered significant attention due to their impressive capabilities. However, leveraging LLMs can be expensive due to the computational resources required, with costs depending on invocation numbers and input prompt lengths. Generally, larger LLMs deliver better performance but at a higher cost. In addition, prompts that provide more guidance to LLMs can increase the probability of correctly processing the job but also tend to be longer, increasing the processing cost. Therefore, selecting an appropriate LLM and prompt template is crucial for achieving an optimal trade-off between cost and performance. This paper formulates the job assignment on LLMs as a multi-objective optimisation problem and proposes a local search-based algorithm, termed LSAP, which aims to minimise the invocations cost while maximising overall performance. First, historical data is used to estimate the accuracy of each job submitted to a candidate LLM with a chosen prompt template. Subsequently, LSAP combines heuristic rules to select an appropriate LLM and prompt template based on the invocation cost and estimated accuracy. Extensive experiments on LLM-based log parsing, a typical software maintenance task that utilizes LLMs, demonstrate that LSAP can efficiently generate solutions with significantly lower cost and higher accuracy compared to the baselines.
- Subject
- large language models,; job assignment; local search; log parsing
- Identifier
- http://hdl.handle.net/1959.13/1515624
- Identifier
- uon:56906
- Identifier
- ISBN:979-8-4007-0495-6
- Rights
- This work is licensed under a CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)
- Language
- eng
- Full Text
- Reviewed
- Hits: 263
- Visitors: 279
- Downloads: 17
Thumbnail | File | Description | Size | Format | |||
---|---|---|---|---|---|---|---|
View Details Download | ATTACHMENT01 | Publisher version (open access) | 894 KB | Adobe Acrobat PDF | View Details Download |